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ABSTRACT 

The  MANPOWER  program  file  consists  of  computer  programs  developed  to  facilitate 
machine  extraction  and  processing  of  Five-Year  Defense  Program  and  other  manpower  data. 
The  file  includes  routines  for  data  file  building,  manipulation,  and  data  transformation  and 

analysis. 

A set  of  utility  routines  is  described  which  allows  the  user  to  extract  and  execute  any 

• program  on  the  MANPOWER  file  without  having  to  handle  any  of  the  source  program 
card-decks.  Using  these  routines,  the  MANPOWER  program  file,  and  a data  file,  the  entire 

* process  of  building  and  analyzing  a data  bank  could  be  accomplished  as  a single  job.  This 

, system  has  been  developed  for  use  on  Control  Data  3400,  3600,  and  3800  machines. 

Modifications  to  some  of  the  programs  might  be  necessary  if  other  equipment  were  used. 
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SUMMARY  OF  PROGRAMS 


These  computer  programs  were  written  in  support  of  INS  Study  31,  "Estima- 
tion of  Navy  Support  Manpower  Requirements  (C)",  (reference  (d)).  Although 
some  of  the  programs  were  developed  specifically  for  building  and  processing 
the  study  data,  many  are  of  a generally  useful  nature  and  probably  could  be 
adapted  easily  to  almost  any  computer.  The  data  analysis  routines  accept 
either  card  images  or  formatted  tape  input,  making  them  available  for  the 
user  without  modification.  A short  summary  of  each  program  contained  in  the 
package,  referred  to  as  the  MANPOWER  PROGRAM  FILE,  follows. 

PESORT  and  PE  are  file  building  and  editing  routines  used  for  processing 
raw  FYDP  data  into  a form  usable  for  analytic  purposes. 

MOMENTS  is  a generalized  program  and  provides  for  computing  central 
moments,  intercorrelations,  and  multiple  regression  equations  for  a data  matrix 
with  as  many  as  100  variables  and  unlimited  observations.  It  can  be  used  to 
plot  and  print  distributions  and  univariate  histograms,  as  well  as  bivariate  data 
plots . 

CLUSTER  is  a generally  useful  program  for  the  grouping  and  reduction  of 
data.  Algorithms  for  grouping  elements  similar  to  one  another  and  dissimilar 
to  other  elements  are  described.  Various  indexes  of  similarity  of  data  elements 
may  be  used  in  the  grouping  process. 

PEXGROUP,  although  written  for  the  purpose  of  computing  comparative 
statistics  between  FYDP  program  elements  and  composites  of  program  elements, 
can  be  easily  modified  to  compute  statistics  including  part -whole  correlation 
coefficients  and  other  measures  used  in  test  construction  research. 

REGFIT  and  ITERFIT  are  both  nonlinear  regression  programs.  Iterative 
routines  are  used  to  determine  least  squares  parameters  for  fitting  a set  of 
data  points  and  for  extrapolating  beyond  the  range  of  the  data . 

The  utility  routines  are  to  facilitate  card  and  tape  handling  for  both  the 
FYDP  analyst  or  general  user  of  these  computer  programs . 


-1- 

(RE VERSE  BLANK) 


«* 


PROGRAM  PESORT 


GENERAL  DESCRIPTION 

This  program  processes  "raw"  FYDP  data  by  scanning  the  file  for  logical 
inconsistencies,  and  performs  a preliminary  sort  of  the  data. *  * 

Output 

• Listing  of  the  program  element  (PE)  dictionary; 

• Listing  of  the  preliminary  sort  categories; 

• Tally  of  number  of  PE's  per  sort  category; 

• Summary  of  data -by  service  (Navy,  Marine  Corps); 

• Summary  by  first  digit  of  PE  code; 

• Magnetic  tape  file  of  the  processed  FYDP  data. 

Limitations 

• Processes  each  year  sequentially  for  a given  group  of  PE's. 

INPUT  REQUIREMENTS 

• Dictionar}'  • A dictionary  of  FYDP  PE’s  containing  the  PE  numbers 
and  their  corresponding  mnemonics. 

• Group  Identifiers  punched  with  a sort -category  label.  Cards 
interspersed  with  the  PE  dictionary  deck  are  used  to  denote  the  beginning  of 
a category  grouping  of  PE’s. 

• Datafile.  The  card  images,  loaded  onto  magnetic  tape,  represent 
manpower  counts  and  Total  Obligational  Authority  (TOA)  values  for  a single  PE 
for  a single  year.  The  hierarchial  ordering  of  FY  by  PE  is  required,  i.e.,  all 
PE's  for  a given  fiscal  year  are  adjacent  records  on  the  data  file.  A special 
code,  -1000,  in  the  PE  identification  fields  serves  as  an  end -of -year  marker. 
LUN  I is  designated  as  the  data  input  file. 

CARD  ORDER 
Set 

1.  Descriptive  information 

2.  Parameter  specification 

3.  Number  of  PE’s/category 

4.  Group  names,  PE  dictionary 

5.  FYDP  DATA  (on  tape) 

*This  program  has  the  capability  of  producing  a final  sort,  but  other  programs 
described  later  do  this  more  efficiently.  The  logical  consistency  checks  are 
the  primary  functions  performed  by  this  program . 
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SPECIFIC  CARD  PREPARATION 

Sequence  Contents/format 


1. 

Descriptive  information 

• Any  alphanumeric  punches  in  columns  1-80.  , 

2. 

Parameter  specification 

Columns 

Contents 

1-3 

Number  of  categories  (NC) 

4-6 

Blank 

7-9 

Number  of  years  of  data  on  input  file 

10-12 

Any  3 -digit  code  used  as  identification  on  output  file 

13-15 

Code  controlling  output  file 

0 punch  = no  tape  copy  of  summary  (by  category)  data 
> 2 punch  = tape  copy  will  be  produced  on  LUN  2 

16-18 

Number  of  files  to  be  skipped  on  input  file 

19-21 

PUNCH  2 if  columns  13-15  > 1 

23-25 

A blank  or  zero  punch  suppresses  output  tape  when  input  * 

(FYDP)  file  is  sorted  by  PE's  within  categories.  Punch  1 4 

if  this  output  is  needed.  Output  will  be  on  LUN  2. 

25-27 

Year  corresponding  to  starting  date  of  input  data  file,  e.g.,  * 

62  = 1962. 

3. 

Number  of  PE's  per  category.  A list  with  the  number  of 
elements  corresponding  to  the  value  punched  in  columns  1 -3 

of  the  Parameter  specification  card.  n.  | i = 1,  NcJ 

Format:  (2513) 

4. 

Group  names  (labels)  cards  interspersed  with  PE  dictionary. 
A label  card,  followed  by  £n.  | i = 1,  NC] 

PE  dictionary  cards,  is  prepared  as  follows: 

Label:  Punch  any  alphanumerics  in  columns  1-80  which  will 
serve  as  a label  for  the  PE  grouping  or  category.  Follow 
this  by  the  set  of  n.  cards  containing  the  PE  name  in  columns 

1-28,  with  identification  code  in  columns  29-34. 

5. 

FYDP  data  file.  The  current  version  of  PESORT  requires  that 
the  FYDP  data  be  formatted  as  follows: 

*This  notation  is  used  to  indicate  a number,  n.,  which  is  the  ith  element  of  a 
set  of  NC  values  of  n. 
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Column/position 


Contents 


1-2  Fiscal  year  (e.g.,  1969  would  be  punched  as  69). 

5-10  PE  6 -digit  code 

11-20  TOA  (non-discounted) 

21-28  Navy  - number  officers 

29-36  Navy  - number  enlisted 

37-44  Marines  - number  officers 

45-52  Marines  - number  enlisted 

53-60  Civilians  - direct  hire  - U.S. 

61-68  Civilians  - direct  hire  - foreign 

69-76  Civilians  - contract 

77-80  Blank 

These  are  card  images  loaded  on  7 -channel,  556 -BPI,  BCD  magnetic  tape  as 
unblocked  (single)  records.  The  data  are  "stacked"  so  that  all  PE's  for  a 
given  year  are  contiguous  on  the  tape.  A card  punched  with  -1000  starting 
in  column/position  6 is  used  as  an  end-of-year  marker.  The  last  end-of-year 
marker  on  the  tape  is  followed  by  an  end -of -file  marker. 
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SAMPLE  FORMAT  FOR  HYPOTHETICAL  FYDP  DATA 


Card  number 


cc 


1-41 


Layout 


4 


Test  A case,  A 5 A PE's,  A 2 A categories,  A2A  years 


2 


3 


cc 

17 

4 

Group  A 1 

\cc 

1-28 

29-34 

5 

PF-A 

00001 1 

6 

PE-B 

000021 

\CC  j 

1-8 

7 

Group  A 11 

\ cc 

1-28 

29-34 

8 

PE-C 

000031 

9 

PE-D 

000041 

10 

PE-E 

000051 

\ cc/ 

1-2 

Variable 

11 

68 

00001 1 

12 

68 

000021 

13 

68 

000031 

14 

68 

000041 

15 

68 

000051 

16 

-1000 

17 

69 

00001 1 

18 

69 

000021 

19 

69 

000031 

20 

69 

000041 

21 

69 

000051 

22 

-1000 

23  EOF 


NOTE:  cc  deno  tes  card  column;  tp,  tape  position. 
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PROGRAM  PE 
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GENERAL  DESCRIPTION 

An  extension  of  Program  PESORT,  PE  is  the  key  program  used  to  process 
the  raw  FYDP  data  as  input  to  the  analytical  routines.  The  data  processing 
consists  of  3 phases:  categorization/sorting,  merging,  and  display.  PE  group- 
ings or  categories  are  defined  by  the  user,  and  FYDP  data  for  each  category 
are  aggregated  into  yearly  composites  as  hard  copy  (printout). 

Output 

• Listing  of  PE  dictionary,  elements  grouped  by  categories; 

• Magnetic  tape  file  of  the  FYDP  data; 

• Hard  copy  display  of  data,  summarized  by  PE  category  (see  table  1); 

• Merged  FYDP  (by  category)  tape -file. 

Limitations 

• A maximum  of  450  PE’s.  The  sum  of  PE  categories  and  composites 
of  the  categories  may  not  exceed  50. 

* • A maximum  of  15  years  of  FYDP  data  may  be  processed  with  a 
single  pass  of  the  input  file. 

* INPUT  REQUIREMENTS 

• Dictionary.  (See  PROGRAM  PESORT) 

• Data  file.  (See  PROGRAM  PESORT.)  Mount  tape  on  LUN  1. 

• Group  identifiers.  (See  PROGRAM  PESORT.) 

• Composite  request  cards.  Composites  consisting  of  the  sums  of 
elements  of  different  categories  are  obtained  by  indicating  the  indexes  of  the 
categories  to  be  included  in  a composite  on  this  card. 

CARD  ORDER 
Set 

1.  Parameter  specification 

2.  Number  PE/category 

3.  Category  group  names,  PE  dictionary 

4.  Indexes  of  categories  for  each  composite 

* 5.  Names  for  components 
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SPECIFIC  CARD  PREPARATION 


Sequence  Contents /format 

1.  Parameter  specification 

Columns  Contents 

1-3  Number  of  categories  (N),(<  50) 

4-6  Number  of  composites  to  be  formed  (K),  (<  25) 
7-9  Blank 


2. 

3. 

4. 

5. 


10-12  Number  of  FY's  represented  on  input  data  file 

13-15  Code  representing  FY  of  earliest  FYDP  data  on  the 

16-18  Logical  unit  number  (LUN)  containing  pre-processed  FYDP 
data  (by  category)  to  be  merged  with  output  of  this  job. 

21  To  eliminate  list  of  PE  dictionary,  punch  an  integer  > 1. 

Number  of  PE's  per  category.  Punch,  right  justified,  in 
fields  of  3 columns,  a maximum  of  25  per  card. 

Format:  (2513).  Continue  on  another  card  if  necessary. 

Category,  group  names,  PE  dictionary.  (See  PROGRAM 
PESORr  for  further  details.) 

j-  Prepare  these  cards  as  described  in  PROGRAM  PESORT. 
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PROGRAM  MOMENTS 


GENERAL  DESCRIPTION 

This  program  provides  a comprehensive  statistical  and  graphical  descrip- 
tion of  a set  of  up  to  100  variables  for  as  many  as  9999  observations . 

Output 

• Number  of  observations  having  complete  data; 

• First  4 central  moments,  standard  error,  t values  for  skewness 
and  kurtosis  for  each  variable; 

• Zero -order  intercorrelations  among  variables; 

• Multiple  regression  equations  for  prespecified  group  of  variables; 

• Scatter  plots  of  pairs  of  variables,  with  or  without  identification 
of  individual  data  points ; 

• Histograms  (bar  graphs)  of  univariate  distributions. 

Limitations 

• Total  number  of  variables  before  and  after  transgeneration  (k)  g 100; 

• Total  number  of  observations  per  variable  (N)  ^ 9999; 

• If  plots  or  histograms  are  desired,  N • K<  10000; 

• Number  of  variable  format  cards  <.  9. 

Special  Features 

• Scans  for  missing  data  (blanks  are  missing  data).  Drops  an  observa- 
tion from  sample  statistics  if  one  or  more  variables  is  ’’blank;" 

• Allows  for  transformation  of  variables  (e.g.,  log,  square  root); 

• Variables  may  be  labeled  with  a 6 -character  alphanumeric  mnemonic. 

CARD  ORDER 

1.  Descriptive  information  card 

2.  Parameter  specification  card 
*3.  Variable  name  card  or  cards 

4.  Variable  format  card  or  cards 
*5.  Transformation  card  or  cards 
*6.  Data  cards  (if  card  input) 

*7.  REGRES  card  or  cards 
*8.  PLOT  selection  card  or  cards 
9.  FINISH  card 

(Repeat  1 through  8 for  additional  subanalyses.) 


SPECIFIC  CARD  PREPARATION 


Sequence 


1.  Descriptive  information  card 

• Any  alphanumeric  characters  in  columns  1-72 

2.  Parameter  specification  card 

Columns  Parameter 

1-4  Number  of  observations 

5-7  Number  of  variables  before  transformation 

8 Number  of  variable  format  cards 

9-11  Number  of  plots  and  histograms 

12-13  Logical  unit  number  of  data  input  (other  than  1) 

15  Rewind  input  unit  when  punched  0 or  blank 

16-17  Number  of  transformation  cards 

18-19  Number  of  variables  after  transformation 

21  To  abort  correlation  & regression  analysis,  punch  1 

23  To  read  names  for  variables,  punch  1 

24-25  Number  of  regression  analyses 

Sequence  Type 

*3  Names  for  variables  - (see  column  23  on  Parameter  specification)  card 

Columns  Parameter 

1-6  Alphanumeric  code  for  1st  variable 

7-12  Alphanumeric  code  for  2nd  variable 

67-72  Alphanumeric  code  for  12th  variable 

Use  additional  cards  to  label  all  variables  retained  for  analysis 
after  transformation,  (columns  18-19  on  Parameter  specifica- 
tion card) 

4.  Variable  format  card  or  cards 

Punch  format  of  data  in  columns  1-72  (column  1 of  first  card 
must  contain  an  open  parenthesis) 


♦ 
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Transformation  card  or  cards  (see  columns  16-17  on  Parameter  specifi- 
cation card) 

Columns  Parameter 
1 -5  TRANS 

7-9  Code,  indicating type  of  transformation --see  page  B-9 
10-12  Index  number  of  the  input  variable 
13-15  Index  number  of  the  output  variable 

16-25  A constant- -if  relevant  to  the  transformation  (punch  decimal 
point,  if  applicable) 

*6  Data  cards --if  data  not  on  tape 
Sequence  Type 

Regression  analysis  card  or  cards  (if  column  25  > on  Parameter  speci- 
fication card)  ** 

Columns  Parameter 
1-6  REGRES 

7-9  Index  number  of  the  criterion  (dependent)  variable 

10-12  Total  number  of  predictor  (independent)  variables 
13-15  Index  number  of  1st  predictor  variable 
16-18  Index  number  of  2nd  predictor  variable 
70-72  Index  number  of  20th  predictor  variable 

After  the  last  regression  analysis  desired,  insert  the  following  card: 

Columns  Parameter 
1-4  ENDR 

8.  PLOT  selection  card  or  cards 
Columns  Parameter 
1-4  PLOT 

7-9  Index  number  of  the  variable  as  ABSCISSA,  X 

10-12  Index  number  of  the  variable  as  ORDINATE,  Y 

13-15  YES  if  points  are  to  be  identified 

16-18  YES  if  page  divider  is  to  be  printed 

(Note:  if  X = Y,  the  program  builds  a histogram.  If  a bar  graph  is  desired,  punch 
YES  in  columns  13-15) 
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After  last  PLOT  selection  card,  include  the  following  card: 


Columns  Parameter 
1-4  ENDP 

9.  FINISH  card.  More  than  one  analysis  may  be  processed  as  a single  job. 

Repeat  cards  1 through  8 for  additional  subproblems. 
After  the  last  card  of  the  last  analysis,  the  following  card 
should  be  included: 

1-6  FINISH 


* Optional  depending  upon  contents  of  Parameter  specification  card. 

**  For  more  than  20  predictors,  prepare  as  many  REGRES  cards  as  needed. 
For  exactly  20,  .pr  multiples  of  20  predictor  variables,  follow  the  REGRES  cards 
with  a blank  card. 


« 
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COMPUTATIONAL  PROCEDURE 


Central  Moments 


* 

* 


s 

= XX 

S2 

= XX' 

S3 

= XX' 

S4 

= xx‘ 

Let: 

N = Number  of  observations 

K = Number  of  variables 

X..  = ith  observation  on  ith  variable 
lj 

(The  summation  operator  is  across  N observations  and  the  subscript  denoting 
the  jth  variable  is  dropped.) 

2 

3 

4 

The  mean  (XJ  of  a given  variable  is  computed  as  S/N;  and  the  standard  deviation 
Let: 

K3  = S3  - 3(S2/N)  + 2(S3/N2) 

K4  = S4  - 4*S(S3/N)  + 6(S2*S2/N2)  -3(S4/N3) 

2 

Skewness  (Gl)  is  computed  as  K3/(SD*SD  ) 

Kurtosis  (G2)  is  computed  as 

f N [(N  + 1).  K4  - 3(N  - 1)(S2  - S2/N)2  ] 1 . cn3 

{— (N  - 1)(N  ■>2)W-‘n5)  TbU 

The  standard  error  (SE)  of  Gl  is  [6N(N-1)/(N  - 2)(N  + 1)(N  + 3)]  ^ 

The  standard  error  of  G2  is  [24N(N  - l)2  /(N  - 3)(N  - 2)(N  + 3)(N  + 5)]% 

Values  for  t,  at  d.f . equal  to  CO,  are  given  by 

tG1  = G1/SEG1,  and 

tG2  = G2//SEG2 
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Correlation  and  Regression  Analyses 

Zero -order  intercorrelations  among  the  ijth  pair  of  variables  for  all  pairs 
are  computed  by: 


Multiple  regression  equations  are  constructed  as  follows: 

• Intercorrelations  among  the  M independent  variables  (R)  are 
selected  from  the  matrix  containing  intercorrelations  among  all  variables . 

• A vector,  v,  of  correlations  of  the  dependent  (d)  and  M inde- 
pendent variables  is  also  selected. 

• The  vector  of  standardized  regression  weights,  /3,  is  computed  by 
the  relationship 


which  has  (M  - 1),  (N  - M)  degrees  of  freedom  and  d • M denotes  the 
correlation  of  the  M independent  variables  with  the  dependent  variable,  d . 


n n n 


r.. 


N ZX.X.  - XX.  XX. 
i J l ) 


• Raw  score  regression  weights  are  computed  by 


• The  regression  intercept,  C,  is  determined  by 


M 

C = X -2  b.X 


• The  multiple  correlation  coefficient  r^  ^ ^ is  given  by 


• An  F test  for  the  significance  of  r^  ^ is  given  by 


F 
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• The  relative  contribution  of  each  variable  to  the  predictable 
2 

variance  r^.^  , is  given  for  reference. 


TRANSFORMATION  CODES 

SUBSCRIPT  "OUT"  indicates  index  number  corresponding  to  where  the  trans- 
formed variable  is  stored;  SUBSCRIPT  "IN"  indicates  the  index  number  of  the 
input  variable  which  is  to  be  operated  upon.  C is  a numerical  constant. 


Code 

Resulting  Transformation 

1 

X (OUT)  = SORT  (X(IN)) 

2 

X (OUT)  = ALOG  (X  (IN)) 

3 

X (OUT)  = 1.0/X  (IN) 

4 

X (OUT)  = SORT  (X(IN)  +C) 

5 

X (OUT)  = ALOG  (X(IN)  + C) 

6 

X(OUT)  = X (IN)C 

7 

X(OUT)  = ARCSIN  ( X (IN)) 

8* 

X(OUT)  = X (OUT)  * X(IN) 

9 * 

X (OUT)  = X (OUT)  * X(IN)  * C 

10* 

X (OUT)  = x (OUT)  + X(IN) 

11 

X (OUT)  = X (IN)  + C 

12 

X (OUT)  = 1.  0/(x  (IN)  +C  ) 

13 

X (OUT)  = EXP(X  (IN)) 

* Assumes  X (OUT)  was  created  by  a previous  transformation. 
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PROGRAM  CLUSTER 


GENERAL  DESCRIPTION 

This  program  performs  a cluster  or  profile  analysis  on  a set  of  measure- 
ments for  a given  sample  of  individuals  or  units  to  identify  any  mutually  exclusive 
subgroups  of  either  sampling  units  or  variables  measured. 

Output 

• Means  and  standard  deviations  of  profiles  and  variables; 

• Standard  scores  for  profile  elements/variables; 

• Generalized  distances  between  elements/variables; 

• Congruency  coefficients  between  elements/variables; 

• Cattell's  Tp  coefficients  between  elements/variables; 

• Covariances  between  elements/variables; 

• Correlations  between  elements /variables; 

• Clusters  based  upon 

- correlations 

- congruency  coefficients 

- r 

P . 

• Composite  scores  based  upon  cluster  membership; 

• Mean  correlations  of  profiles  with  clusters; 

• Mean  correlations  between  clusters. 

Limitations 

• The  number  of  variable  format  cards  <9; 

• The  product  of  sample  size  (N)  and  the  number  of  elements  per  pro- 
file (K)<  9,000. 

Special  Features 

• Scans  for  missing  data  (blanks)  and  deletes  a score  profile  if  one  or 
more  elements  are  missing. 

• Standardizes  score  profile  if  requested. 
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CARD  ORDER 


1.  Descriptive  information  card 

2.  Parameter  specification  card 

3.  Variable  format  card  or  card 

4.  Data 

5.  FINISH  card 

(Repeat  1 through  4 for  additional  subanalyses.  Unit  5 should 
be  "equipped"  to  60,  and  unit  6 to  61.) 

SPECIFIC  CARD  PREPARATION 


Sequence 


TXE£. 


1.  Descriptive  information  card 

• Any  alphanumeric  data  in  columns  1-72. 

2.  Parameter  specification  card 


Columns 

1-3 

4-6 

9 

12 


15 

18 

19-21 

22-26 

27-31 

36 

39 

*41 

42-43 


Parameter 

Number  of  sampling  units  (N) 

Number  variables  (K) 

2 

2 punch  = calculate  and  print  D 

1 punch  = calculate  and  print  congruency  coefficients 

2 punch  = print  and  use  congruency  coefficients  as  basis 
for  clustering 

1 punch  = print  covariance  matrix 

2 punch  = calculate,  print  and  use  Pearson  r for  Clustering- 
Alternate  tape  unit  (T)  for  data  input  (50<T>6) 

C(L)  - lower  limit  on  r for  including  a profile  in  a cluster 

C(U)  - upper  limit  on  r for  excluding  a profile  from  a 
cluster  [C(L)  > C(U)  ] . 

1 punch  = do  not  rewind  alternate  input  tape 

1 punch  = print  standard  score  profiles 

1 punch  = punched  output 

Number  of  variable  format  cards 


*Not  recommended. 


45 

0 punch  = as  Cattell  profile  similarity  coefficient 

46-53 

Chi  Square  value  for  determining  r (when  omitted,  co- 
efficient is  not  printed)  P 

54-57 

C(L)  value  for  r 

p 

58-61 

C(U)  value  for  r 

p 

62-63 

2 punch  = transpose  data  matrix  (cluster  by  variables, 
rather  than  by  score  profiles) 

65 

1 punch  = profiles  are  not  to  be  standardized 

67 

1 punch  = uses  absolute  value  of  r as  basis  of  clustering 

68-69 

Minimum  number  of  profiles  needed  to  form  a cluster 
(if  blank,  program  sets  it  equal  to  3. ) 

Sequence 

Type 

3. 

Variable 

format  card  or  cards 

4. 

Data  (cards  or  tape) 

5.  FINISH  card 


* 


• Last  card  in  deck  must  contain: 

Columns  - Punch 

1-6  FINISH 

COMPUTATIONAL  PROCEDURE 

Let  X be  a matrix  of  scores,  having  n rows  representing  the  number  of 
subjects  or  sampling  units,  and  k columns  representing  the  number  of  variables 
comprising  the  score  profile  for  a given  sampling  unit.  A given  column  of  X may 
be  represented  as  a vector,  x^,  where  j = 1,  2...,  n,  which  contains  n 

sampling  units  for  the  jth  variable . 

The  standard  score  profile  Z^,  representing  the  ith  row  or  profile  of  X, 
is  given  as 

Z.  = [x.  - Mean  (x)  ] /cr(x) 

This  standardization  procedure  is  equivalent  to  rescaling  all  elements  in 
the  ith  profile,  such  that  the  mean,  or  elevation,  of  the  profile  is  zero  and  its 
variance  is  unity. 

At  the  discretion  of  the  user,  profile  standardization  may  be  omitted.  (If  one 
is  clustering  variables,  rather  than  sampling  units,  standardization  is  the  excep- 
tion, rather  than  the  rule.)  A number  of  different  similarity  coefficients  are  computed. 
A generalized  distance  function  between  the  1th  and  mth  elements  in  X (by 
either  rows  or  columns,  depending  upon  the  option  elected),  is  computed  by 
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n n n 

2 V"  2 v-  2 


Dlm  =1  xiT  + '2<IXil  Xim>  ' "• 


The  congruency  coefficient,  a measure  of  profile  similarity,  is  computed  as 


♦ 


Gim -s  i /izi,2xi  • 

i i i 

where  Z are  scores  standardized  across  sampling  units. 

Covariances  and  Pearson  product  moment  correlations  are  computed  by  the 
usual  method. 

The  final  similarity  coefficient  which  may  be  computed  is  Cattell’s  r^ 
(reference  (a)): 


E,  -SZi 
k lm 


E.  +£  Z. 
k lm 


where  E^  is  twice  the  median  Chi  Square  value  (i.e.  p = . 5)  for  k degrees  of 
freedom  (E.  ^ 2 k).  (A  table  showing  the  median  Chi  Square  value  for  different 
values  of  k,  and  the  probability  of  exceeding  r , is  given  in  reference  (a).) 


The  present  illustration  uses  the  Pearson  product  moment  correlation  co- 
efficient as  the  similarity  measure  between  profiles  as  the  basis  for  clustering. 

Search  for  an  initial  cluster  begins  by  selection  of  a profile  that  correlates 
at  or  above  the  value  C(L.).  The  limit  of  C(L)  may  be  set  at  the  value  at  which 
a correlation  coefficient  based  upon  k independent  variates  is  significant  at  level 
a.  Tiie  profile  having  the  greatest  number  of  significant  correlations  with  other 
profiles  is  selected  as  a pivot.  To  the  pivot  is  added  the  profile  with  the  highest 
average  correlation  with  all  members  in  the  pivot  list.  To  this  first  pair  is  added 
the  crofile  that  correlates  highest  on  the  average  with  members  of  the  list  belonging 
to  the  pair.  The  entire  remaining  (n-2)  x (n-2)  correlation  matrix  is  searched 
for  the  profile  that  has  the  highest  average  correlations  with  those  already  in  the 
cluster  and  this  profile  is  added  to  the  cluster.  This  procedure  is  repeated  until 
there  are  no  other  profiles  whose  average  correlations  with  members  of  the  clus- 
ter equals  or  exceeds  C(L). 
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Next,  an  upper  limit  C(U),  which  is  a cutting  point  used  to  define  dissimi- 
larity, is  set  to  prevent  cluster  overlap.  A suitable  value  is  the  point  where  a 
correlation  coefficient  is  significant  at  some  p value,  > p for  C(L).  Any  profile 
in  the  residual  matrix,  i.e. , those  not  already  selected  for  the  J-lth  cluster, 
that  correlates  on  the  average  > C(U),  is  deleted  and  is  not  considered  for  any 
subsequent  clusters.  The  clustering  process  continues  until  at  least  m members 
can  be  formed  (set  by  the  user,  where  m >3).  (See  Lorr  (reference  (b))  for 
applications  of  this  method. ) 
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PROGRAM  PEXGROUP 


GENERAL  DESCRIPTION 

This  program  computes  part-whole  correlations  between  a composite 
(category)  and  each  item  (PE)  in  the  composite. 

Output 

• r., c (i  = Rem;  c = composite)  - item,  composite  correlation 

• rc»i  - correlation  of  item  and  composite  with  the  variance  in  c,  due 
to  i_  held  constant; 

• List  of  the  data  for  each  PE  in  the  FYDP  data  bank; 

• Means  and  standard  deviations; 

• Count  of  data  points  for  each  PE. 

Input  Requirements 

• FYDP  data  file,  as  described  in  PROGRAM  PESORT.  Mount  tape  on 
LUN  1. 

• PE  dictionary,  as  described  in  PROGRAM  PESORT. 

Limitations 

• Will  only  perform  an  analysis  of  a single  variable  or  a single  com- 
posite for  a single  pass  of  the  data  file. 

CARD  ORDER* 

1.  Descriptive  information  card 

2.  Parameter  specification  card 

3.  Group  category  size  card 

4.  Group  name  cards 
PE  -dictionary 

5.  FINISH 


Repeat  card  (sets)  1-2  only  for  analysis  of  different  composites  of  variables 
from  data  file . 


SPECIFIC  CARD  PREPARATION 

Sequence  C onte  nts  /f  o rm  at 

1.  Descriptive  information 

• Any  alphanumeric  characters  in  columns  1 -80 

2.  Parameter  specification  card 

Columns  Contents 

1-3  Number  of  categories 

4- 6  Number  of  PE's  included  in  data  file 

5- 9  Number  of  composites  to  be  formed 

10-12  Index  number  of  the  1st  variable  in  the  composite 

13-15  Index  number  of  the  2nd  variable  in  the  composite  (in  fields  ot 

3 columns  for  up  to  8 variables) 

3.  Group  size  card  (2513).  The  number  of  PE's  comprising  each 

category 

4.  Group  names  - PE  dictionary.  See  PROGRAM  PESORT  for 
detailed  information. 

5.  FINISH  card  - punch  - FINISH  in  columns  1-6  to  indicate  end -of- job 


COMPUTATIONAL  PROCEDURE 

Let  C represent  a composite  of  n items  designated, 

x.  ji  = 1,  M land 


n 


C = 2 
i=l 


x. 

l 


The  correlation  of  an  x.  with  C is  given  by  the  product  moment  correlation, 


c x. 

i 


The  correlation  of  x.  and  C,  independent  of  the  variance  of  x.  is  deter- 
mined by 


r ax.  - a 

C . x.  - ri  ’ c 1 c 


where 


a 2x.  + a2  - 2r. 

i c i , c ax.  ac 


x.  f x 
1Z  c 


* 


% 


V 

* 
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ALGORITHM 


Start 
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PROGRAM  REGFIT 


GENERAL  DESCRIPTION 

th 

REGFIT  is  a generalized  routine  for  computing  the  coefficients  for  an  m 
order  polynomial  using  a least  squares  regression  method.  The  routine  computes 
polynomials  for  the  entire  range  of  1 through  m and  evaluates  goodness  of  fit 
after  each  iteration. 

Output 

• Means  and  standard  deviations  of  (the  dependent  variable)  and 

through  Xm  (powers  of  the  independent  variable); 

• Raw  (b)  and  standardized  (B)  regression  weights; 

A 

• Product -moment  correlation  between  X and  Y (predicted  or  fitted 
dependent  variable); 

• Y intercept,  standard  error  of  estimate; 

t Various  tests  for  goodness  of  fit; 

A 

• Y values  and  confidence  limits  for  user  specified  values  of  X,  not 
necessarily  within  the  range  of  the  original  X values; 

• Optional  listing  of  X,  Y,  and  Y -predicted  values. 

Input  Requirements 

• X and  Y data  points  (variable  format). 

Limitations 

• The  array  containing  original  X,  Y values  is  limited  to  50  variables 
and  20  observations  per  variable. 

• m<  10 

• Missing  data  (observations)  are  treated  as  real,  zero  values. 

SPECIFIC  CARD  PREPARATION 

Sequence  Contents /format 

1.  Title  card --any  descriptive  information  (columns  1-80) 

2.  Parameter  specification 
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Columns 


Contents 


I- 3 
4-6 
7-9 

II- 12 
13-15 


3. 

4. 

5. 

Columns 

1-6 

*7-9 

10-12 

13-15 

16-17 

18-19 

20-21 


N (number  observations) 

M (number  variables) 

Input  unit  number  of  data  file  (blank  = cards) 

FY  of  earliest  data  contained  on  input  file;  (e.g. , 63  = 1963). 
Blank  or  zero,  produces  estimates  rounded  to  nearest  unit. 
Punch  >1,  output  in  E 20.  8 format 

Variable  format  card  (columns  1-80) 

DATA  (if  on  cards) 

FIT  cards 

Contents 

Any  non -blank  characters 

Index  of  X (independent)  variable 

Index  of  Y (dependent)  variable 

M-degree  of  highest  order  polynomial  desired 

Punch  NO  if  listing  of  X,  Y and  Y -predicted  not  wanted 

Any  non -blank  characters  to  add  additional  data  point;  X = Y = 0 

A code  to  indicate  status  of  prediction  option. 

Code  Prediction  option* ** 

0 No  prediction 

1 List  of  X values  are  to  be  read 

2 A range  of  X values  are  used. 


* Punch  zero  (0)  if  X is  a time  series. 

**  See  5A,  5B,  and  5C. 
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<K 


Sequence 

5. 

22-23 

24-25 


5(A) 

Columns 

1-3 

5(B) 

5(C) 

Columns 

1-8 

9-16 

19-24 

5(D) 


Code  (K)  specifying  the  subset  of  observations  selected 
from  data  bank.  When  K<0,  all  observations  are  used, 
when  K>2,  subsets  of  data  are  selected.  See  5D. 

A code  controlling  computation  of  the  standard  error  of 
Y predicted  from  X. 

Code  Result 

0,  blank  SE2  = a2  Y(1  - R2) 

>1  SE2  = cr2  Y(1  - R2)  . (n|n  - M) 

< 0 Both  sets  of  estimates  are  used 

Contents 

Number  of  X values  to  be  read 

(10F8.0)  X values,  10  per  card.  Use  as  many  cards 
as  necessary. 


Contents 

Smallest  value  of  X (min) 

Largest  value  of  X (max) 

N number  of  steps  from  X (min)  to  X (max).  Interval 
size  = X max  - X min 

N STEPS 

For  K^2,  a series  of  K integers,  corresponding  to  years 
(i.e.,  62  = 1962)  are  read  using  the  format  (2513).  A 
range  of  years  may  also  be  used  as  input  for  this  option. 
When  the  integer  appearing  in  the  field  of  the  second 
year  is  preceded  by  a minus  (-)  punch,  the  absolute 
value  of  this  index  determines  the  upper  bound  of  the 
range.  For  example,  the  value  62-70  would  include 
all  observations  in  the  range  of  1962  through  1970 
as  input  for  the  analysis. 
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[ Card  set  5 and  its  subsets  may  be  repeated  for  analysis  of  additional  X,  Y 
pairs  of  variables  included  in  the  original  NxM  data  matrix.  ] 

Sequence 

6.  END  FIT  card --a  blank  card  to  indicate  that  no 

more  "fits"  are  required. 

[ Card  sets  1-6  may  be  repeated  for  analysis  of  additional  data  matrices.] 

7.  FINISH  card --punch  FINISH  in  columns  1-6  to 

indicate  end  of  run. 


* 
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ALGORITHM 


Start 


» 
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COMPUTATIONAL  PROCEDURE 


The  following  formulas  were  used  to  compute  the  coefficients  for  the  po’y  - 
nomials  and  to  perform  various  statistical  tests  on  the  data. 


Let 


= the  ith  sampling  unit  (FY)  for  the  jth  variable  (category)  where 

i = 1,N  and  f j = 1,  M and  S is  the  NXM  matrix  of  data 
Elements  *■  ' 


x = a vector  of  N data  points  constituting  the  independent  variable 
(predictor),  and 

y = a corresponding  vector  containing  the  N elements  of  the  dependent 
variable  (criterion);  such  that 

Xe  S and  y ( S,  with  X f Y 

A polynomial  of  the  form 


(1>  Yi  = bo  + blXi  + b2Xi2+'--+bmXim 

is  generated  by  least  squares  regression.  That  is,  M (M^2)  vectors  of  powers 
of  X are  generated  such  that  the  ith  vector  is  equal  to  X1. 

If  we  let  R indicate  the  matrix  of  intercorrelations  of  the  m vectors  of  X, 

m — 

and  V.,  a vector  of  correlations  between  Y and  each  X vector,  then  the 
1 

standardized  regression  coefficients,  fh  may  be  obtained  from  the  relationship* 


(2) 


j3.  = R_1V. 

l l 


This  provides  (standardized)  least  squares  estimates  of  the  m coefficients 
of  the  polynomial  shown  in  (1).  Raw  score  coefficients  are  then  obtained 
from  the  relationship, 

(3)  b.  - gy^.  where  a represents  the  standard  deviation  of  the  ith 
1 a iXi  ' 


vector  of  scores. 


♦ 


The  intercept,  bQ  , is  determined  by, 

m 

b = M - ?(M  .b .),  where  M . represents  the  arithmetic  mean 
o y i xi  l xi 

of  the  ith  vector  of  scores. 

Other  statistics  supporting  (1)  are  also  obtained. 

The  squared  multiple  correlation  of  Y with  / x.  i i = 1,  m \ is  given  by 


R" 


y-x 


m 

= 1 
i=l 


®i  v i ’ 


which  can  be  interpreted  as  the  proportion  of  the  variance  in  Y jointly  accounted 
for  by  the  different  powers  of  X . 

The  F test  of  (statistical)  significance  of  R is  determined  by, 

y- x 


F =/  R" 


y-x 

m 


1 - Rz 

y-x 

n-m-l 


with  (m-1)  and  (n-m-l)  degrees  of  freedom.  (Here,  M = N)  . 

The  standard  error  of  estimate  (at  the  mean  of  x)  of  R is 

y-x 


SE  = a 

y 


y-x 


Two  tests  for  indicating  the  gain  afforded  by  adding  another  term  to  the  polyno- 
mial are: 


2 2 

(R  - R 
'y-x  y-x  , 

3 m 3 m-1 


) (n-m-l) 


, which  is  distributed  as  F with  1 


1 - R' 


y-x. 


m 


and  (n-m-l)  degrees  of  freedom;  and  the 


1 - R' 


y-  m 


1 - 1 / (n-m-l) 


1 - R' 


y * (m-1) 
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"Wherry  inequality,  " which  may  also  be  used  as  a "rule  of  thumb"  for  interpre- 
ting the  gain  afforded  by  using  an  m vs  m + 1 degree  polynomial  (see  reference 

(c)). 


Finally,  the  percent  of  Ry,x  accounted  for  by  a given  x^  is 


ft 


V. 

1 


x 100  . 


A 1 \ rn 

Confidence  bounds  of  Y,  given  an  x,  are  computed  as  follows: 


Let: 


SE2 


k x k 


= Squared  standard  error  at  (mean)  x based  upon 
m observations. 

R ***  R 

= deviation  score;  x.  - x 

= weighted  variance  --  covariance  matrix, 
containing  elements: 


C11  C12 


'Ik 


'21' 


“kr 


"kk, 


where,  c ^ = c2i’  and> 


n n 

c..=nZ  2 x 1 x 

1J  i l t l 


k = a vector  containing  K elements  of  x 


2 k 

X,  X , . . . , X 
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The  standard  error  of  an  individual  forecast,  X. , represented  by  s . (x  . ), 
is  given  by  ^ 

y2 


SE2 

( i + S£y  • 2 +e) 

y- 

\ n ) J 

where  e = v^  C kxk  (Read  T as  transpose.) 

A 

The  or  - probability  confidence  limits  of  Y,  predicted  by  the  regression  of 
Y on  X , are  obtained  by 


A A 

Y_  = Y±  as 


a 


y-  f (x  . ) 
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PROGRAM  ITERFIT 


GENERAL  DESCRIPTION 

ITERFIT  combines  the  capabilities  of  least  squares  (L.  S. ) curve  fitting  and 
profile  similarity  analysis  to  determine  the  effect  of  adding  a (disparate)  data 
point,  near  the  origin  (X=  Y = 0)  or  elsewhere,  to  the  shape  of  a curve  describing 
an  original  X,  Y relationship.  The  "original"  curve  is  described  by  a polynomial, 
Pp,  of  degree  m (here,  m = 2)  determined  by  a least  squares  regression  for  the 

original  N pairs  ( X , Y)  of  data  points.  Another  polynomial  P^,  also  of  degree 

m,  constituting  the  first  iteration,  is  generated  by  using  N + 1 pairs  of  points. 
Differences  in  the  shapes  of  the  profiles  between  the  N pairs  of  data  points  genera- 
ted by  PQ  and  P^  can  then  be  compared  with  a zero -order  product-moment  corre- 
lation coefficient,  and  the  elevations,  or  distances  between  profiles,  with  a t test. 
The  routine  uses  the  L.  S.  regression  method,  as  described  in  PROGRAM  REGFIT. 
Comparisons  among  the  various  iterated  profiles  are  also  performed. 

Output 

2 

• R , F,  regression  weights  (b),  Y-intercept,  (bg),  standard  error  of 
estimate. 

• Correlations  (as  indexes  of  goodness  of  fit)  between  original  (N  points) 
and  fitted  values  of  Y for  each  iteration. 

• Student' s t statistic  indicating  the  significance  of  the  distance  between 
profiles; 

• Test  of  the  significance  of  the  differences  of  iterated  profile  correla- 
tions and  elevations. 

INPUT  REQUIREMENTS 

• Xand  Y data  points  (variable  format) 

Limitations 

• The  array  containing  original  (X  , Y)  values  is  limited  to  50  variables 
and  20  observations  per  variable.  (Larger  arrays  can  be  accomodated 
by  recoding  a number  of  DIMENSION  Statements. ) 

• liim<10 

• Missing  observations  are  treated  as  real,  zero  values. 
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CARD  ORDER* 


1 . Title  card 

2.  Parameter  Specification  card 

3.  Variable  format  card 

4.  Data  (if  on  cards) 

5.  FIT  card(s) 

6.  END  FIT  card 

7.  FINISH  card 

SPECIFIC  CARD  PREPARATION 


Sequence  Contents/format 

1*  Title  card.  Any  alphanumeric  information  in 

columns  1-80. 

2.  Parameter  specification 


Columns 

1-3 

4-6 

7-9 

10-12 


Contents 

Number  of  observations  (n) 

Number  of  variables 

LUN  of  input  data  (blank  assumes  cards) 

Number  of  iterations 


Variable  format  card.  Use  columns  1-80. 


4. 


DATA  (if  on  cards) 


5. 

1-6 

7-9 

10-12 

13-15 

16-17 

18-20 


FIT  cards 

Any  non -blank  characters 

Index  of  X (independent)  variable 

Index  of  Y (dependent)  variable 

M,  the  degree  of  the  highest  order  polynomial 
desired 

BLANK 

Number  of  iterations  ( if  blank,  the  value  on  the 
Parameter  Specification  Card,  columns  10-12 
will  be  the  determining  value). 


m 


9 
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ALGORITHM 


m 
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6. 


END  FIT  card.  A blank  card  following  the  last 
FIT  card  is  used  to  signal  the  last  sub -analysis 
of  the  data  (as  in  4). 

7.  FINISH  card.  Punch  FINISH  in  columns  1-6  to 

indicate  the  end  of  the  job. 

COMPUTATIONAL  PROCEDURE 

The  regression  coefficients  and  supporting  statistics  for  the  least  squares 
polynomials  are  computed  as  reported  for  PROGRAM  REGFIT.  Additional  sta- 
tistics, comparing  the  original  with  the  fitted  points  and  fitted  points  with  other 
fitted  data  points,  are  computed  as  described  below. 

Preceding  the  summary  of  results  for  a given  iteration,  the  coordinates  of 
the  additional  (N  + 1th)  data  point  are  labeled  as  X = 0 , Y = a,  where  a is  a 
value  between  zero  and  the  intercept  of  the  original  linear  regression  equation. 
The  interval  size  of  the  distribution  of  values  assigned  to  a is  equal  to: 

range  (a) 
k 

where  k is  the  number  if  iterations,  or  steps,  requested  by  the  user,  and 
1 — k—  50.  The  labels  appearing  on  the  computer  output  are  as  follows. 

The  label,  R-FIT,  refers  to  the  product -moment  correlation  between  the 
points  of  the  original  fitted  quadratic  and  the  iterated  fitted  quadratic  equations 
based  upon  the  original  N data  points. 

R(  0)  refers  to  the  product  moment  correlation  between  the  original  N data 
points  and  the  fitted  data  points  resulting  from  a given  iteration. 

The  T-FIT  values  are  Student's  t statistics  (N  - 1,  d.f. ) comparing  the  mean 
differences  of  the  values  of  the  fitted  quadratics  with  the  original  and  iterated 
equations. 

The  T-(0)  is  analagous  to  T-FIT  with  the  exception  that  the  values  of  the 
fitted  quadratic  are  compared  to  those  of  the  original  N data  points. 

A matrix  comparing  the  results  of  the  different  iterations  is  also  displayed. 
The  values  in  the  diagonal  are  the  R-FIT  values  described  above.  The  values 
below  the  diagonal  are  the  product -moment  correlations  among  the  fitted  quad- 
ratic values  for  each  iteration.  The  entries  above  the  main  diagonal  are  Student's 
t statistics  for  the  test  of  the  hypothesis  that 

<ri0  = r,0),  where 
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i represents  the  fitted  values  generated  from  the  ith  iteration, 
j represents  the  fitted  values  generated  from  the  jth  iteration,  and 
0 represents  the  original  values  of  the  N data  points, 
t is  computed  by 


t = 


(ri0  - rJ0> 


( n - 3 ) < 1 + r ) 
2 1 


1 " rij  ' ri0  ' rj0  + 2 ( riOrjOrij  ) 
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MANPOWER  UTILITY  ROUTINE 

PROGRAM  MANPOWER  is  a utility  routine  that  organizes  the  9 source 
programs  of  the  MANPOWER  Program  File.  Execution  of  any  program  may  be 
accomplished  by  following  the  steps  outlined  below. 

1 . Instruct  the  computer  operator  to  mount  CNA  tape  reel  number  1834  on 
logical  unit  number  14.  This  input  tape  is  written  in  BCD  with  a 
density  of  556  bpi. 

2.  Include  these  "systems"  cards: 

JOB,  etc 

% EQIP  14  = (SOMAR  PROGRAMS),  HI 
LOADMAIN,  14 

3.  Precede  the  control  cards  for  each  program  of  interest  with  a card  con- 
taining the  name  of  the  program  punched  in  card  columns  1 through  8. 

4.  Following  the  last  control  card  of  the  last  program,  include  a card 
with  the  characters  ENDSOMAR  in  the  first  8 columns. 
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PROGRAM  INTERFAC 


GENERAL  DESCRIPTION 

This  utility  routine  will  read  and  store  an  array  of  data  and  output  selected 
elements  from  the  array.  Variable  formatting  is  utilized  in  both  the  input  and 
output  phases.  The  routine  may  be  used  to  reformat  a data  file,  select,  sort 
and/or  merge  up  to  a 3 -way  categorization  of  the  data. 

Input/ Output 

• Equal  length,  unblocked  BCD  records. 

Limitations 

• Uses  a single  FORTRAN  READ  Statement  to  input  an  array  of  N 
elements. 

• N is  the  product  of  KA,  KB,  and  KC,  defined  by  the  user  at  execution 
time. 

• N<103 
CARD  ORDER 


Set 

1.  Parameter  specification 

2.  Input  format 

3.  Data  (if  on  cards) 

4.  Output  selection 

5.  Output  format 


SPECIFIC  CARD  PREPARATION 


Sequence 

1 

Columns 

1-3 

4-6 

7-9 

10-12 

13-15 

16-18 


Contents/format 
Parameter  specification 
Contents 
LUN  of  input 
LUN  of  output 

Number  of  elements  in  first  subset  of  output  array 
Number  of  elements  in  second  subset  of  output  array 
Number  of  elements  in  third  subset  of  output  array 
Number  of  elements  in  first  subset  if  input  array 


19-21 

Number  of  elements  in  second  subset  of  input  array 

s 

22-24 

Number  of  elements  in  third  subset  of  input  array 

25-26 

Index  of  first  subscript  used  for  ordering  output 

4 

27-28 

Index  of  second  subscript  used  for  ordering  output 

29-30 

Index  of  third  subscript  used  for  ordering  output 

Sequence 

Contents/format 

2 

Input  format- -punch  FORMAT  statement  with 
an  open  parentheses  beginning  in  column  1 

*3 

Data  (if  on  cards) 

4 

Output  selection 

Columns 

Contents 

1-3 

Index  of  1st  elements 

4-6 

Index  of  2nd  elements 

(etc.  , for  up  to  25  elements /card. ) 

Set 

a. 

(a.  | i = 1,  NA) 

b. 

(b.  | j = 1,  NB)  , and 

JL 

c. 

(ck|  k=  1,  NC) 

* 

5 

Output  format- -punch  FORMAT  statement  with 
an  open  parentheses  beginning  in  column  1. 

ALGORITHM 


Data  are  represented  in  a 3 -dimensional  array,  X (I,  J,  K).  The  range  of 
subscripts  are: 

(1=  1,  NI) 

(J  = 1,  NJ) 

(K  = 1,  NK) 

Data  are  read  into  a linear  array  Y,  which  is  EQUIVALENCED  TO  X,  by  the 
statement, 

READ  (LUN,  FORMAT)  (Y(L)  , L = 1,  N)  . 

Here,  N is  defined  as  the  products  of  NI  x NJ  x NK.  The  relationship  between 
Xand  Y is  illustrated  in  table  1. 

See  section  on  indexing  procedure  for  input/output  arrangements  of  data. 
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TABLE  1 


X , Y RELATIONSHIPS  FOR  NI  = 3,  NJ  = 2,  NK  = 2 


X 

(I)  (J)  (K) 

1 1 1 

1 1 2 

1 2 1 

1 2 2 

2 1 1 

2 1 2 

2 2 1 

2 2 2 

3 1 1 

3 12 

3 2 1 

3 2 2 


Y 

(L) 

1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 


Output  Phase 

An  output  hierarchy,  determining  the  ordering  of  the  indexes,  I,  J,  K is 
provided  by  the  user.  The  digits  1,  2,  3 are  used  as  codes  for  the  indexes, 

I,  J,  K,  For  example,  the  configuration  of  indexes,  2,  1,  3,  would  produce  the 
following  output,  for  all  levels  of  J,  I,  and  K for  the  data  presented  in  table  1. 

Element 

X 


(I)  (J)  (K) 

1 1 1 

1 1 2 

2 1 1 

2 1 2 

3 1 1 

3 1 2 

1 2 1 

1 2 2 

2 2 1 

2 2 2 

3 2 1 

3 2 2 


Order  of  Output 

1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 


Note  that  the  configuration  represents  a lexigraphical  ordering  of  the  hier- 
archy of  the  range  of  subscripts. 
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PROGRAM  ONTAPE 


GENERAL  DESCRIPTION 

This  program  loads  cards  onto  magnetic  tape  and  produces  a count  of 
the  number  of  records  transferred  onto  the  tape  (used  as  input  to  process  Five 
Year  Defense  Program  (F/DP)  data). 

Output 

• 80-character,  unblocked,  BCD,  tape  records  on  Logical  Unit  Number 
(LUN)  1; 

• Card  count; 

• END-OF-FII.E  (EOF)  after  last  record  copied. 

Input 

• Data  cards  to  be  copied; 

• END-OF -DATA  card.  (Card  punched  with  X in  columns  1-8  as  a 
signal  that  the  input  file  is  exhausted. ) 
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